Image capture and marking

ABSTRACT

A digital image is scanned to generate image data, which is processed to steganographically embed plural-bit auxiliary data. The system is useful, e.g., for capturing and marking digital images without noticeably changing their visual appearance.

RELATED APPLICATION DATA

This application is a division of application Ser. No. 09/417,153, filed Oct. 12, 1999, which is a continuation of application Ser. No. 09/317,784, filed May 24, 1999 now U.S. Pat. No. 6,072,888, which is a continuation of application Ser. No. 09/074,632, filed May 7, 1998 now U.S. Pat. No. 5,930,377, which is a continuation of application Ser. No. 08/969,072, filed Nov. 12, 1997 now U.S. Pat. No. 5,809,160, which is a continuation of application Ser. No. 07/923,841, filed Jul. 31, 1992 now U.S. Pat. No. 5,721,788. The foregoing applications are incorporated by reference herein.

TECHNICAL FIELD

This invention relates to a method of and system for encoding a signature into a digital image.

BACKGROUND OF THE INVENTION

Various images in traditional print or photographic media are commonly distributed to many users. Examples include the distribution of prints of paintings to the general public and photographs and film clips to and among the media. Owners may wish to audit usage of their images in print and electronic media, and so require a method to analyze print, film and digital images to determine if they were obtained directly from the owners or derived from their images. For example, the owner of an image may desire to limit access or use of the image. To monitor and enforce such a limitation, it would be beneficial to have a method of verifying that a subject image is copied or derived from the owner's image. The method of proof should be accurate and incapable of being circumvented. Further, the method should be able to detect unauthorized copies that have been resized, rotated, cropped, or otherwise altered slightly.

In the computer field, digital signatures have been applied to non-image digital data in order to identify the origin of the data. For various reasons these prior art digital signatures have not been applied to digital image data. One reason is that these prior art digital signatures are lost if the data to which they are applied are modified. Digital images are often modified each time they are printed, scanned, copied, or photographed due to unintentional “noise” created by the mechanical reproduction equipment used. Further, it is often desired to resize, rotate, crop or otherwise intentionally modify the image. Accordingly, the existing digital signatures are unacceptable for use with digital images.

SUMMARY OF THE INVENTION

The invention includes a method and system for embedding image signatures within visual images, applicable in the preferred embodiments described herein to digital representations as well as other media such as print or film. The signatures identify the source or ownership of images and distinguish between different copies of a single image.

In a preferred embodiment described herein, a plurality of signature points are selected that are positioned within an original image having pixels with pixel values. The pixel values of the signature points are adjusted by an amount detectable by a digital scanner. The adjusted signature points form a digital signature that is stored for future identification of subject images derived from the image.

The foregoing and other features of the invention will be more readily apparent from the following detailed description, which proceeds with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram of a computer system used in a preferred embodiment of the present invention.

FIG. 2 is a sample digital image upon which a preferred embodiment of the present invention is employed.

FIG. 3 is a representation of a digital image in the form of an array of pixels with pixel values.

FIG. 4 is graphical representation of pixel values showing relative minima and maxima pixel values.

FIG. 5 is a digital subject image that is compared to the image of FIG. 2 according to a preferred embodiment of the present invention.

DETAILED DESCRIPTION

The present invention includes a method and system for embedding a signature into an original image to create a signed image. A preferred embodiment includes selecting a large number of candidate points in the original image and selecting a number of signature points from among the candidate points. The signature points are altered slightly to form the signature. The signature points are stored for later use in auditing a subject image to determine whether the subject image is derived from the signed image.

The signatures are encoded in the visible domain of the image and so become part of the image and cannot be detected or removed without prior knowledge of the signature. A key point is that while the changes manifested by the signature are too slight to be visible to the human eye, they are easily and consistently recognizable by a common digital image scanner, after which the signature is extracted, interpreted and verified by a software algorithm.

In contrast to prior art signature methods used on non-image data, the signatures persist through significant image transformations that preserve the visible image but may completely change the digital data. The specific transforms allowed include resizing the image larger or smaller, rotating the image, uniformly adjusting color, brightness and/or contrast, and limited cropping. Significantly, the signatures persist through the process of printing the image to paper or film and rescanning it into digital form.

Shown in FIG. 1 is a computer system 10 that is used to carry out an embodiment of the present invention. The computer system 10 includes a computer 12 having the usual complement of memory and logic circuits, a display monitor 14, a keyboard 16, and a mouse 18 or other pointing device. The computer system also includes a digital scanner 20 that is used to create a digital image representative of an original image such as a photograph or painting. Typically, delicate images, such as paintings, are converted to print or film before being scanned into digital form. In one embodiment a printer 22 is connected to the computer 12 to print digital images output from the processor. In addition, digital images can be output in a data format to a storage medium 23 such as a floppy disk for displaying later at a remote site. Any digital display device may be used, such a common computer printer, X-Y plotter, or a display screen.

An example of the output of the scanner 20 to the computer 12 is a digital image 24 shown in FIG. 2. More accurately, the scanner outputs data representative of the digital image and the computer causes the digital image 24 to be displayed on the display monitor 14. As used herein “digital image” refers to the digital data representative of the digital image, the digital image displayed on the monitor or other display screen, and the digital image printed by the printer 22 or a remote printer.

The digital image 24 is depicted using numerous pixels 24 having various pixel values. In the gray-scale image 24 the pixel values are luminance values representing a brightness level varying from black to white. In a color image the pixels have color values and luminance values, both of which being pixel values. The color values can include the values of any components in a representation of the color by a vector. FIG. 3 shows digital image 24A in the form of an array of pixels 26. Each pixel is associated with one or more pixel values, which in the example shown in FIG. 3 are luminance values from 0 to 15.

The digital image 24 shown in FIG. 2 includes thousands of pixels. The digital image 24A represented in FIG. 3 includes 225 pixels. The invention preferably is used for images having pixels numbering in the millions. Therefore, the description herein is necessarily a simplistic discussion of the utility of the invention.

According to a preferred embodiment of the invention numerous candidate points are located within the original image. Signature points are selected from among the candidate points and are altered to form a signature. The signature is a pattern of any number of signature points.

In a preferred embodiment, the signature is a binary number between 16 and 32 bits in length. The signature points may be anywhere within an image, but are preferably chosen to be as inconspicuous as possible. Preferably, the number of signature points is much greater than the number of bits in a signature. This allows the signature to be redundantly encoded in the image. Using a 16 to 32 bit signature, 50-200 signature points are preferable to obtain multiple signatures for the image.

A preferred embodiment of the invention locates candidate points by finding relative maxima and minima, collectively referred to as extrema, in the image. The extrema represent local extremes of luminance or color. FIG. 4 shows what is meant by relative extrema. FIG. 4 is a graphical representation of the pixel values of a small portion of a digital image. The vertical axis of the graph shows pixel values while the horizontal axis shows pixel positions along a single line of the digital image. Small undulations in pixel values, indicated at 32, represent portions of the digital image where only small changes in luminance or color occur between pixels. A relative maximum 34 represents a pixel that has the highest pixel value for a given area of the image. Similarly, a relative minimum 36 represents a pixel that has the lowest pixel value for a given area of the image.

Relative extrema are preferred signature points for two major reasons. First, they are easily located by simple, well known processing. Second, they allow signature points to be encoded very inconspicuously.

One of the simplest methods to determine relative extrema is to use a “Difference of Averages” technique. This technique employs predetermined neighborhoods around each pixel 26; a small neighborhood 28 and a large neighborhood 30, as shown in FIGS. 2 and 3. In the present example the neighborhoods are square for simplicity, but a preferred embodiment employs circular neighborhoods. The technique determines the difference between the average pixel value in the small neighborhood and the average pixel value of the large neighborhood. If the difference is large compared to the difference for surrounding pixels then the first pixel value is a relative maxima or minima.

Using the image of FIG. 3 as an example, the Difference of Averages for the pixel 26A is determines as follows. The pixel values within the 3.times.3 pixel small neighborhood 28A add up to 69; dividing by 9 pixels gives an average of 7.67. The pixel values within the 5.times.5 pixel large neighborhood 30A add up to 219; dividing by 25 pixels gives an average of 8.76 and a Difference of Averages of −1.09. Similarly, the average in small neighborhood 28G is 10.0; the average in large neighborhood 30G is 9.8; the Difference of Averages for pixel 26G is therefore 0.2. Similar computations on pixels 26B-26F produce the following table:

26A 26B 26C 26D 26E 26F 26G Small Neighborhood 7.67 10.56 12.89 14.11 13.11 11.56 10.0 Large Neighborhood 8.76 10.56 12.0 12.52 12.52 11.36 9.8 Difference of −1.09 0.0 0.89 1.59 0.59 0.2 0.2 Averages

Based on pixels 26A-26G, there may be a relative maximum at pixel 26D, whose Difference of Averages of 1.59 is greater than the Difference of Averages for the other examined pixels in the row. To determine whether pixel 26D is a relative maximum rather than merely a small undulation, its Difference of Averages must be compared with the Difference of Averages for the pixels surrounding it in a larger area.

Preferably, extrema within 10% of the image size of any side are not used as signature points. This protects against loss of signature points caused by the practice of cropping the border area of an image. It is also preferable that relative extrema that are randomly and widely spaced are used rather than those that appear in regular patterns.

Using the Difference of Averages technique or other known techniques, a large number of extrema are obtained, the number depending on the pixel density and contrast of the image. Of the total number of extrema found, a preferred embodiment chooses 50 to 200 signature points. This may be done manually by a user choosing with the keyboard 16, mouse 18, or other pointing device each signature point from among the extrema displayed on the display monitor 14. The extrema may be displayed as a digital image with each point chosen by using the mouse or other pointing device to point to a pixel or they may be displayed as a list of coordinates which are chosen by keyboard, mouse, or other pointing device. Alternatively, the computer 12 can be programmed to choose signature points randomly or according to a preprogrammed pattern.

One bit of binary data is encoded in each signature point in the image by adjusting the pixel values at and surrounding the point. The image is modified by making a small, preferably 2%-10% positive or negative adjustment in the pixel value at the exact signature point, to represent a binary zero or one. The pixels surrounding each signature point, in approximately a 5.times.5 to 10.times.10 grid, are preferably adjusted proportionally to ensure a continuous transition to the new value at the signature point. A number of bits are encoded in the signature points to form a pattern which is the signature for the image.

In a preferred embodiment, the signature is a pattern of all of the signature points. When auditing a subject image, if a statistically significant number of potential signature points in the subject image match corresponding signature points in the signed image, then the subject image is deemed to be derived from the signed image. A statistically significant number is somewhat less than 100%, but enough to be reasonably confident that the subject image was derived from the signed image.

In an alternate embodiment, the signature is encoded using a redundant pattern that distributes it among the signature points in a manner that can be reliably retrieved using only a subset of the points. One embodiment simply encodes a predetermined number of exact duplicates of the signature. Other redundant representation methods, such as an error-correcting code, may also be used.

In order to allow future auditing of images to determine whether they match the signed image, the signature is stored in a database in which it is associated with the original image. The signature can be stored by associating the bit value of each signature point together with x-y coordinates of the signature point. The signature may be stored separately or as part of the signed image. The signed image is then distributed in digital form.

As discussed above, the signed image may be transformed and manipulated to form a derived image. The derived image is derived from the signed image by various transformations, such as resizing, rotating, adjusting color, brightness and/or contrast, cropping and converting to print or film. The derivation may take place in multiple steps or processes or may simply be the copying of the signed image directly.

It is assumed that derivations of these images that an owner wishes to track include only applications which substantially preserve the resolution and general quality of the image. While a size reduction by 90%, a significant color alteration or distinct-pixel-value reduction may destroy the signature, they also reduce the image's significance and value such that no auditing is desired.

In order to audit a subject image according to a preferred embodiment, a user identifies the original image of which the subject image is suspected of being a duplicate. For a print or film image, the subject image is scanned to create a digital image file. For a digital image, no scanning is necessary. The subject digital image is normalized using techniques as described below to the same size, and same overall brightness, contrast and color profile as the unmodified original image. The subject image is analyzed by the method described below to extract the signature, if present, and compare it to any signatures stored for that image.

The normalization process involves a sequence of steps to undo transformations previously made to the subject image, to return it as close as possible to the resolution and appearance of the original image. It is assumed that the subject image has been manipulated and transformed as described above. To align the subject image with the original image, a preferred embodiment chooses three or more points from the subject image which correspond to points in the original image. The three or more points of the subject image are aligned with the corresponding points in the original image. The points of the subject image not selected are rotated and resized as necessary to accommodate the alignment of the points selected.

For example, FIG. 5 shows a digital subject image 38 that is smaller than the original image 24 shown in FIG. 2. To resize the subject image, a user points to three points such as the mouth 40B, ear 42B and eye 44B of the subject image using the mouse 18 or other pointer. Since it is usually difficult to accurately point to a single pixel, the computer selects the nearest extrema to the pixel pointed to by the user. The user points to the mouth 40A, ear 42A, and eye 44A of the original image. The computer 12 resizes and rotates the subject image as necessary to ensure that points 40B, 42B, and 44B are positioned with respect to each other in the same way that points 40A, 42A, and 44A are positioned with respect to each other in the original image. The remaining pixels are repositioned in proportion to the repositioning of points 40B, 42B and 44B. By aligning three points the entire subject image is aligned with the original image without having to align each pixel independently.

After the subject image is aligned, the next step is to normalize the brightness, contrast and/or color of the subject image. Normalizing involves adjusting pixel values of the subject image to match the value-distribution profile of the original image. This is accomplished by a technique analogous to that used to align the subject image. A subset of the pixels in the subject image are adjusted to equal corresponding pixels in the original image. The pixels not in the subset are adjusted in proportion to the adjustments made to the pixels in the subset. The pixels of the subject image corresponding to the signature points should not be among the pixels in the subset. Otherwise any signature points in the subject image will be hidden from detection when they are adjusted to equal corresponding pixels in the original image.

In a preferred embodiment, the subset includes the brightest and darkest pixels of the subject image. These pixels are adjusted to have luminance values equal to the luminance values of corresponding pixels in the original image. To ensure that any signature points can be detected, no signature points should be selected during the signature embedding process described above that are among the brightest and darkest pixels of the original image. For example, one could use pixels among the brightest and darkest 3% for the adjusting subset, after selecting signature points among less than the brightest and darkest 5% to ensure that there is no overlap.

When the subject image is fully normalized, it is preferably compared to the original image. One way to compare images is to subtract one image from the other. The result of the subtraction is a digital image that includes any signature points that were present in the subject image. These signature points, if any, are compared to the stored signature points for the signed image. If the signature points do not match, then the subject image is not an image derived from the signed image, unless the subject image was changed substantially from the signed image.

In an alternative embodiment, the normalized subject image is compared directly with the signed image instead of subtracting the subject image from the original image. This comparison involves subtracting the subject image from the signed image. If there is little or no image resulting from the subtraction, then the subject image equals to the signed image, and therefore has been derived from the signed image.

In another alternate embodiment, instead of normalizing the entire subject image, only a section of the subject image surrounding each potential signature point is normalized to be of the same general resolution and appearance as a corresponding section of the original image. This is accomplished by selecting each potential signature point of the subject image and selecting sections surrounding each potential signature point. The normalization of each selected section proceeds according to methods similar to those disclosed above for normalizing the entire subject image.

Normalizing each selected section individually allows each potential signature point of the subject image to be compared directly with a corresponding signature point of the signed image. Preferably, an average is computed for each potential signature point by averaging the pixel value of the potential signature point with the pixel values of a plurality of pixels surrounding the potential signature point. The average computed for each signature is compared directly with a corresponding signature point of the signed image.

While the methods of normalizing and extracting a signature from a subject image as described above are directed to luminance values, similar methods may be used for color values. Instead of or in addition to normalizing by altering luminance values, the color values of the subject image can also be adjusted to equal corresponding color values in an original color image. However, it is not necessary to adjust color values in order to encode a signature in or extract a signature from a color image. Color images use pixels having pixel values that include luminance values and color values. A digital signature can be encoded in any pixel values regardless of whether the pixel values are luminance values, color values, or any other type of pixel values. Luminance values are preferred because alterations may be made more easily to luminance values without the alterations being visible to the human eye.

From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims. 

We claim:
 1. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the processing results in steganographic encoding of the auxiliary data into regions of the image having substantially non-uniform pixel values.
 2. The method of claim 1 wherein: the image data comprises rows of pixels, each pixel having at least a luminance value associated therewith; the encoding results in increasing the luminance values of at least certain of said pixels; and the encoding further results in decreasing the luminance values of at least certain others of said presented pixels.
 3. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the processing includes processing in accordance with a pseudo-random parameter.
 4. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the processing distributes energy of the encoded data through the image and adapts the energy distribution in accordance with local characteristics of the image, so as to make the image encoding relatively less visible.
 5. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein local changes to image data are scaled in accordance with local pixel values.
 6. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein: the image data comprises rows of pixels, each pixel having at least a luminance value associated therewith; the encoding results in increasing the luminance values of at least certain of said pixels; the encoding further results in decreasing the luminance values of at least certain others of said presented pixels; and a change of a single bit of said plural-bit auxiliary data between first and second values is manifested as changed luminance in plural non-contiguous pixels located in plural non-contiguous rows of said digital image.
 7. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the encoding includes processing plural M-row by M-column areas of adjoining pixels in said image data.
 8. The method of claim 7 where 5 pixels<=M<=10 pixels.
 9. The method of claim 7 in which a change of a single of said plural bits of auxiliary data between first and second values is manifested as changed luminance in pixels in plural of said M-column by M-row areas.
 10. The method of claim 7 wherein certain M-row by M-column areas of pixels are determined to be unsuitable for encoding, so said encoding does not result in changed luminance values for pixels within said certain areas.
 11. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the encoding is area-based, with each area spanning M contiguous rows and M contiguous columns of pixels, where M is at least five.
 12. The method of claim 11 wherein M is no greater than ten.
 13. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the processing includes encoding said plural-bit auxiliary data in a redundant fashion permitting error correction.
 14. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the image data is represented by plural elements, each having a parameter value, and the encoding includes changing the parameter values of said elements to encode the auxiliary data therein, and scaling a magnitude of said changes in accordance with said parameter values.
 15. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the processing includes identifying portions of the image where encoding might be relatively more apparent, and avoiding said portions when encoding.
 16. A method of operating an image capture device, comprising: presenting a physical subject to a photosensor array to produce image data corresponding to said subject; and processing said image data to steganographically encode plural bit auxiliary data therein, thereby yielding encoded image data, the steganographic encoding locally changing the luminance of the image data; wherein the image data comprises plural data elements, the method including selecting data elements to be changed to effect said encoding based not just on the values of individual data elements, but on an attribute of a neighborhood of data elements, said neighborhood being comprised of data elements having substantially non-uniform values.
 17. The method of claim 16 that includes changing said selected data elements based, at least in part, on initial values thereof.
 18. The method of claim 16 wherein at least certain excerpts of the image are not marked because marking in such excerpts would more likely be conspicuous.
 19. An image capture device for capturing and outputting image data, the image capture device comprising plural opto-electronic sensors, and a processor coupled thereto for inserting steganographic information into image data output by said device, wherein the steganographic information locally changes the luminance of the image data, and energy of the steganographic information is distributed in accordance with local features of the image data.
 20. A scanner according to claim
 19. 